Reducing the effects of noise in non-volatile memories through multiple reads

ABSTRACT

Storage elements are read multiple times and the results are accumulated and averaged for each storage element to reduce the effects of noise or other transients in the storage elements and associated circuits that may adversely affect the quality of the read. Several techniques may be employed, including: A full read and transfer of the data from the storage device to the controller device for each iteration, with averaging performed by the controller; a full read of the data for each iteration, with the averaging performed by the storage device, and no transfer to the controller until the final results are obtained; one full read followed by a number of faster re-reads exploiting the already established state information to avoid a full read, followed by an intelligent algorithm to guide the state at which the storage element is sensed. These techniques may be used as the normal mode of operation, or invoked upon exception condition, depending on the system characteristics. A similar form of signal averaging may be employed during the verify phase of programming. An embodiment of this technique would use a peak-detection scheme. In this scenario, several verify checks are performed at the state prior to deciding if the storage element has reached the target state. If some predetermined portion of the verifies fail, the storage element receives additional programming. These techniques allow the system to store more states per storage element in the presence of various sources of noise.

BACKGROUND OF THE INVENTION

[0001] 1. Field of the Invention

[0002] This invention relates generally to computer readable memory devices, and, more specifically, to methods for reducing noise when reading their information content.

[0003] 2. Background Information

[0004] In non-volatile semiconductor memories, such as EEPROMs, the amount of data stored per memory cell has been increased in order to increase storage densities. At the same time, the operating voltages of such devices have decreased to reduce power consumption. This results in a greater number states stored in a smaller range of voltage or current values. As the voltage or current separation between data states decreases, the effects of noise become more significant in the reading of these cells. For example, variations in the threshold value acceptable in a binary storage, 5 volts EEPROM cell may no longer be acceptable in a device operating at 3 volts with four or more bits storable per cell. Some consequences of noise, and methods for dealing with it, in a nonvolatile memory are described in U.S. Pat. No. 6,044,019, which is hereby incorporated by reference.

[0005] An example of noisy behaviour is shown in FIG. 1A, which is adapted from U.S. Pat. No. 6,044,019. This figure shows the variation in the current flowing through a memory cell in response to a particular set of bias conditions. The current fluctuates by an amount ΔI due to various noise effects in the memory cell and interfacing circuitry. If, for example, the memory circuit works by current sensing, as the separation between states approaches ΔI, the noise will begin to produce erroneous read values. Although the consequences of noise can be decreased by integration sensing techniques, such as those in U.S. Pat. No. 6,044,019, or treated with error correction code (ECC) or other equivalent error management, such as is described in U.S. Pat. No. 5,418,752, which is hereby incorporated herein by this reference, memories could benefit from further methods to reduce the effects of noise on memory operation.

SUMMARY OF THE INVENTION

[0006] The present invention presents methods to further reduce the effects of noise in non-volatile memories, thereby allowing the system to store more states per storage element in circuits where noise and other transients are a significant factor. The storage elements are read multiple times by sensing a parameter indicative of their state. The results are accumulated and averaged for each storage element to reduce the effects of noise in the circuits as well as other transients that may adversely affect the quality of the read.

[0007] The methods of the present invention can be implemented through several techniques. In a first set of embodiments, a full read and transfer of the data from the storage device to the controller device is performed for each iteration, with averaging performed by the controller. In a second set of embodiments, a full read of the data for each interruption is performed, but the averaging is performed within the storage device and no transfer of data to the controller takes place until the final results are sent. A third set of embodiments perform one full read followed by a number of faster re-reads by exploiting the already established state information to avoid a full read, with some intelligent algorithm to guide the state at which the storage element is sensed. This technique may be used as the normal mode of operation, or invoked upon exception condition, depending on the system characteristics.

[0008] A similar form of signal averaging may be employed during the verify phase of programming. An exemplary embodiment of this technique would use a peak-detection scheme. In this scenario, several verify checks are performed at the target state prior to deciding if the storage element has reached the final state. If some predetermined portion of the verifies fail, the storage element receives additional programming.

[0009] Additional aspects, features and advantages of the present invention are included in the following description of specific representative embodiments, which description should be taken in conjunction with the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

[0010] FIGS. 1A-C show examples of noisy response in a memory cell.

[0011]FIG. 2 illustrates the effect of noise on a program and verify operation.

[0012]FIG. 3 is a schematic block diagram of the operation of the present invention according to an exemplary embodiment.

[0013]FIGS. 4A and 4B are block diagrams of circuit embodiments of some aspects of the present invention.

[0014]FIG. 5 is a schematic representation of binary search implementation of an exemplary embodiment.

DESCRIPTION OF REPRESENTATIVE EMBODIMENTS

[0015] The storage capacity of non-volatile semiconductor memories has increased both by the reduction in the physical size of the individual components of the circuits including the memory cell and by increasing the amount of data storable in an individual memory cell. For example, devices such as those described in U.S. Pat. Nos. 5,712,180 and 6,103,573 and U.S. patent applications Ser. No. 09/505,555 filed on Feb. 17, 2000, and Ser. No. 09/667,344, filed on Sep. 22, 2000, which are all assigned to SanDisk Corporation and which are all hereby incorporated herein by this reference, can store four or more logical bits per physical floating gate storage transistor. In the case of four logical bits, this storage will require each floating gate to be able to have encoded within it one of sixteen possible memory states. Each one of these memory states corresponds to a unique value, or, more accurately, a narrow range of values, of stored charge on the floating gate which is sufficiently separated from its neighboring states' charge storage range of values to clearly differentiate it from those neighboring states as well as all the other states. This is true for both a normal read operation as well as for a verify read done as part of a programming operation.

[0016] A number of techniques for reading memory cells are described in U.S. patent application Ser. No. 09/671,793, filed on Sep. 27, 2000, and one entitled “Sense Amplifier for Multilevel Non-Volatile Integrated Memory Devices” by Shahzad Khalid, filed on Nov. 20, 2001, which are assigned to SanDisk Corporation and which are both hereby incorporated herein by this reference. The first of these applications describes reading the content of the memory cell with 7-bit or higher resolution for use in 4-bit storage. When this number of levels is contained in a 3 volt window of operating voltages as is typical in a FLASH memory, this results in approximately a 25 mV resolving capability. With this range, levels of noise formerly acceptable can now lead to the state of the cell being read with an error of one or more state levels.

[0017] FIGS. 1A-C illustrate examples of noise fluctuations in the reading of a memory cell. FIG. 1A shows the case where the value of some parameter indicative of the state of the memory cell, in this case the current I(t) fluctuates around some mean value <I>. In other embodiments, the memory can operate by reading voltage levels. If the value of the I(t) is read more or less instantaneously at regular intervals, the result would be something like what is shown in FIG. 1B. If instead the period over which the parameter indicating the state of the cell is read is several times longer than the typical time scale of the fluctuations, a relatively accurate value for the state of the cell can be determined. Some methods for accomplishing this are described in U.S. Pat. No. 6,044,019, from which FIGS. 1A and 1C are adapted and which has been incorporated by reference above.

[0018] As described in that patent, a memory cell has its source-drain current I(t) sensed by a sense amplifier. At the sense amplifier, I(t) typically has a noise component as illustrated in FIGS. 1A and 1C with noise fluctuations given by ΔI. One special feature of the sense amplifier in that invention is to process this fluctuating current by means of an integrator, which effectively produces a time-averaged current <I(t)>_(T) over a predetermined period T. An A/D module further converts the sensed analog, time-averaged current into a digital format corresponding to an output memory state Φ.

[0019]FIG. 1C illustrates a source-drain current I(t) having noise fluctuations with a characteristic period T_(ΔI), and the resulting time-averaged <I(t)>_(T) as processed by the: sense amplifier of the present invention. The sensed source-drain current of a memory cell, I(t) has a time-dependent random noise component ΔI. When I(t) is averaged over a sufficient period of time T, the noise fluctuations are substantially attenuated. The resultant error in <I(t)>_(T) is given by δI which is substantially less than ΔI. The sufficient period of time for averaging would be a period T substantially greater than a characteristic time T_(Δ1) of the noise fluctuation. This T_(ΔI) can be defined as a period where a predetermined amount of noise fluctuation has been sufficiently attenuated.

[0020] For example, in many EEPROM or flash EEPROM devices, the characteristic fluctuation time T_(ΔI) has been estimated to be from 10 nanoseconds to several hundred nanoseconds. Thus, for these typical devices, the analog averaging or integration time T for the device described in U.S. Pat. No. 6,044,019 should preferably be substantially higher than T_(ΔI). In contrast, prior art sensing of I(t) is detected more or less “instantaneously”, that is, in a time substantially less than T_(ΔI), and produces the sort of output shown in FIG. 1B depending on when the measurement is made. When the integration time T is essentially zero, as in the conventional case of no time averaging, the noise fluctuation in a sensed current is given by ΔI. As the averaging time is increased, the noise fluctuation in the time-averaged sensed current decreases due to increased phase cancellations.

[0021] Programming a memory cell is usually accomplished through a program/verify cycle, such as that described, for example, in U.S. Pat. No. 5,172,338 which is hereby incorporated by reference. In the typical process, a memory cell is pulsed with a programming voltage and its value is checked to determine whether or not it has reached the desired target state. If not, the process is then repeated until it is so verified. Since this verification is a sensing process, it will also be affected by noise. This is shown in FIG. 2.

[0022]FIG. 2 shows the influence of noise on a program/verify process. After each programming pulse, the state of the cell is checked through a sensing process. Referring to FIG. 2, the state of the measured parameter, I(t), after such a pulse, is shown as the value I(t) 201 that includes a noise component. As the state of the cell reflected by 201 is below the target value, the cell is again pulsed, as indicated by the blank interval, followed by a new sensing 202. As the state of the cell approaches the target value, the noise component can lead to an erroneous verification.

[0023] When the state of the cell is near the target value, the peaks in parameter due to noise may pass the target value, even through the cell is not yet in the desired state. This is shown by 204 where the peaks in I(t) cross the target value even though its average value, <I(t)>, is still not at the target value. If the verify read is performed at one of these peak values of 204, the cell could incorrectly verify instead of continuing on to the more accurate state represented by 205.

[0024] When sensing the state of a cell, whether as part of a verify operation or as a read operation, to obtain an accurate value for its state by the methods described above requires that the read period T be long with respect to the characteristic time T_(ΔI) of the noise fluctuation. This technique can have a number of shortcomings. Depending on how long this characteristic time is, this may require an excessively long read period when compared to the other operations of the memory and would result in a corresponding unacceptable loss of read speed. The present invention uses a short sense time, but repeats the process several times for a given read with the multiple sensings then averaged to obtain the state of the cell. For noise with a component having a long characteristic time, the multiple short sensings may require a combined time less than needed for the single read of the analog integration. Also, since a large number of memory cells are being read simultaneously, there are practical limits placed on the integrating analog circuits used in the integration technique.

[0025] Another limitation of the integration technique is that the <I(t)> value can be skewed by very short duration but large noise events. Such a short duration event can be isolated in a single read of the multiple sensing method and then removed along with other outlying values when the average is computed. Additionally, as the value of each of the sensed cells is separately determined a number of times, the values can be processed in a number of additional ways not available for the integration technique.

[0026] More particularly, the storage elements are read multiple times and the results are accumulated and digitally averaged for each storage element. This reduces the effects of noise in the circuits as well as other transient that may adversely affect the quality of the read. Several embodiments are discussed below. In one embodiment, a full read and transfer of the data from the storage device to the controller device is performed for each iteration, with averaging performed by the controller. Alternately, the device may perform a full read of the data for each iteration, but with the digital averaging performed by the storage device and no transfer to the controller until the final results are sent. Instead of performing a full read for each iteration, the memory may instead perform an initial full read followed by a number of faster re-reads exploiting the already established state information to avoid a full read, with some intelligent algorithm to guide the state at which the storage element is sensed. These techniques may be used as the normal mode of operation, or invoked upon exception condition, depending on the system characteristics. A similar form of signal averaging may be employed during the verify phase of programming. An embodiment of this technique would use a peak-detection scheme. In this scenario, several verify checks are performed at the target state prior to deciding if the storage element has reached the final state. If some predetermined portion of the verify fails, the storage element is programmed more. As the amount of noise is reduced, this technique can allow the system to store more states per storage element as compared to when noise and other transients are a significant factor.

[0027] The two read techniques, the digital averaging of multiple reads and the extended integration time for analog averaging, should be considered complementary. Their relative effectiveness depends on the characteristic frequency of the noise. Since the noise may arise from a number of different sources each with different characteristic frequencies, whether to use the first, second, or both techniques may depend upon the operating-conditions of the device. For example, if the noise is composed of a high-frequency component and a low frequency component, the integration time for each of the individual sensing operations could be taken long enough to effectively integrate out the high frequency components, while the multiple reads of the present invention could be used to reduce the effect of the low frequency component. Both the length of the integration interval and the number of re-reads can be adjusted according to the frequency characteristics of the noise. These can be set by parameters during testing and stored on the system.

[0028] An additional read technique that can be further combined with these other techniques is described in a co-pending U.S. patent entitled “Noise Reduction Technique For Transistors and Small Devices Utilizing an Episodic Agitation” by Nima Mokhlesi, Daniel C. Guterman, and Geoff Gongwer, filed concurrently with the present application and which is hereby incorporated by reference. This application describes techniques suitable when the noise has a component with a bimodal behavior.

[0029] The state of a storage element can be determined using a number of different parameters. In the examples above, the determination of a cell's stored charge level can be performed by current sensing, where the magnitude of its conduction, using fixed bias conditions, is sensed. Alternately, such determination can be made through sensing threshold voltage, where the onset of such conduction is sensed using varied steering gate bias conditions. These methods represent a couple of the more standard approaches.

[0030] Alternately, the determination could be performed dynamically by having the cells' charge-level determined driver-strength control the discharge rate of a dynamically held (by, e.g., a pre-charged capacitor) sense node. By sensing the time to reach a given discharge level, the stored charge level is determined. In this case, the parameter indicative of the cell's condition is a time. This approach is described in U.S. Pat. No. 6,222,762, which is hereby incorporated by reference, and in U.S. patent application entitled “Sense Amplifier for Multilevel Non-Volatile Integrated Memory Devices” that has been incorporated by reference above. Another alternate technique determines the state of the storage elements using frequency as the parameter, an approach described in U.S. Pat. No. 6,044,019 that has also been incorporated above.

[0031] Current sensing approaches are more fully developed in U.S. Pat. No. 5,172,338, that was included by reference above, and U.S. patent application Ser. No. 08/910,947, which is hereby incorporated by reference, and may be employed with the various embodiments described below. However, the majority of the following exposition uses the threshold voltage, V_(th), sensing approach (alternately termed voltage margining) since this improves the sensing resolution, keeps the current, and consequently the power, associated with massively parallel read operations low, and minimizes vulnerability to high bit line resistance. The V_(th) sensing, or voltage margining, approach is developed more fully in U.S. Pat. No. 6,222,762. Another technique of voltage sensing, monitoring the voltage at the source for a fixed drain voltage in response to a given gate voltage, is the source follower technique that is described, for example in U.S. patent application Ser. No. 09/671,793, incorporated by reference above.

[0032] The actual sensing in a voltage margining technique can be performed in several ways, for example by sequentially comparing a cell's threshold voltage to each of the associated reference values. For multi-state cells, it can be more efficient to exploit a cell-by-cell, data conditional binary search as described in above referenced U.S. Pat. No. 6,222,762, which, for example, is capable of determining, in parallel, each of the sensed cells' V_(th) to a one in sixteen resolution through a sequential, four-pass sensing operation. The amount of resolution required for sensing is discussed in U.S. patent application Ser. No. 09/671,793. As usual, the degree of resolution used is often a tradeoff between higher resolution, resulting in more potential “bits” beyond the minimum required 4-bit resolution of the exemplary embodiments (e.g., for use in improving memory reliability), and cost associated with more area, to sense and store these extra bits, as well as more time consumed, both due to the additional sensing passes in the binary search of these extra bits and due to the transferring of this extra information.

[0033] It should be noted that not all of the added time for the additional sensing and processing, or possibly for an increased integration time, need be directly reflected in device speed since the device can perform many of these operations in parallel. For example, if a normal read process requires an 11 pass read operation with 3 μs per pass, the present invention may add an extra half dozen or so final read passes and potentially double the integration time if there was a noise component in the 1 μs range. This will slow nominal read time for a given cell, but offer higher storage density. Increasing parallelism could reduce the effect of this increase in single cell read time. Techniques for increasing the amount of parallelism in non-volatile memories are described in U.S. patent application Ser. No. 09/766,436, filed on Jan. 19, 2001, which is hereby incorporated by reference. Although increasing the number of operations done in parallel often itself increases noise, the present techniques will also be applicable to reducing this noise.

[0034] To extract the data stored in the memory cells, this data must be both written and stored with sufficient fidelity. Consequently, to achieve such fidelity, minimizing the effects of noise are important in the verification parts of the programming operation as well as in the actual reading operation when the state of the cell is converted into a data value.

[0035]FIG. 3 is a schematic block diagram of the operation of the present invention according to an exemplary embodiment. The gate voltage on the cell 301 is set and its bit-lines are pre-charged. During the integration portion of the read phase, the voltage V(t) (including the noise component) is supplied to a sense amplifier SA 310 with an analog to digital converter A-D 311. Sense amplifier SA 310 converts this voltage into an output data state Φ_(i). The operation of the sense amp can be as described in U.S. patent application Ser. No. 09/671,793 that was incorporated by reference above, for example. As described there, the voltage level is sensed with a resolution of 2⁷=128 states to be used in discriminating cells storing 4 or 5 bits per cell. The cell is read several times, each producing a data state Φ_(i). Due to noise, and the use of an integration time shorter than the characteristic time of the noise, the Φ_(i) will be output with a spread much as is shown in FIG. 1B.

[0036] The data states Φ_(i) are then averaged. This can be done in a number of different ways, as is described below, with a simple exemplary embodiment shown in FIG. 3. The averaging circuit AVE 320 consists of an accumulator Σ_(i) 321 and a divider 323. For example, the data state could be measured eight times, the result accumulated in 321 to produce Σ_(i=1) ⁸ Φ_(i), and the divider 323 could be a shift register, which when shifted three times will divide the sum by eight. The output {overscore (Φ)} of AVE 320 is then the average read value, in this example a simple mean.

[0037] A memory system normally consists of one or more memory chips, each of which contain the actual arrays of memory cells, plus a controller chip, although in a single chip embodiment the controller function may be integrated into the same chip as that within which the array is contained. Although the read circuitry SA 310 will be located on the same chip as the memory cells, both the location and operation of the averaging circuit AVE 320 can have a number of different embodiments. More particularly, exactly how the average {overscore (Φ)} is formed from the individual reads, along with the type and location of the circuitry involved in forming this composite, have a number of possible variations. For example, the circuit AVE 320 of FIG. 3 could be placed entirely on the same circuit as the memory cell so that only the composite value {overscore (Φ)} is transfer back to the controller at the end of the process. Alternately, each of the individual Φ_(i) could be transferred to the controller and averaged there.

[0038] Although the specifics will vary with the embodiments described below, a pair of generic situations is shown in FIGS. 4A and 4B. These show a memory chip MEM 400 and a controller CONT 460. The memory chip contains the array of memory cells 401 to which are connected the row decoder circuitry 411 and column decoder circuitry 413. The read circuit 421 will contain the sense amplifiers, such as block 310 of FIG. 3, as well as any other associated circuitry. The output of the read circuit is then supplied to a set of registers 425, which may be absent depending on the embodiment. Although this discussion is mostly given in terms of a single cell, generally a large number of cells are read in parallel and the registers and other elements must support this parallelism. The other elements of the circuit are suppressed to simplify the discussion, but are described more fully in the various patents and applications that are incorporated herein by reference. For example, the circuit will also contain write circuits that are not explicitly shown, but that can be taken as incorporated into the read or other blocks, and a set of programming registers that may be the same as the read registers 429. The bus 430 connecting the memory portion 400 to the controller 460 will transfer data as well as addresses, commands, parameters, and so on between the two. The controller 460 will also have all the usual elements in addition to those specifically shown.

[0039]FIG. 4A is an embodiment wherein the individual reads, Φ_(i) (or Φ₀ and Φ′_(i) as described below), are all sent from the memory to the controller. These values may be stored temporarily in REG 425 on MEM 400 before being sent to the controller CONT 460. Once in the controller, the values may be stored there in REG 461 or other memory and the average {overscore (Φ)} formed in circuit AVE 463 before being output to the host. In the case where AVE 463 accumulates the values as they arrive from MEM 400 to form a running average, the values need not be individually stored in the REG 461.

[0040] The variation of FIG. 4B shows an example in which the composite {overscore (Φ)} is formed on the memory MEM 400 and then passed to the controller to be output to the host. The embodiment shown in FIG. 4B shows an averaging circuit like FIG. 3, consisting of an accumulator 423 and a divider 429 to form a mean {overscore (Φ)} from the individual reads. Alternate implementations on the memory chip would have other or additional elements, such as the set of registers 425 in FIG. 4A that are not shown here in order to individually store the read values.

[0041] For any of the embodiments, the basic read process of the present invention consists of concurrently supplying a parameter set p(t), such as a voltage or current, indicative of the state of the cell, to the read circuit for each of the cells of the set being read in parallel. To do this, the cells to be read are biased according to the read technique being used. The parameter p(t) will contain a noise component. The read circuit then produces the data states Φ_(i)(p) corresponding to the biasing conditions from which the composite value {overscore (Φ)} is then formed and output from the controller.

[0042] In a first set of embodiments, all of the reads of the cells are full reads; that is, if the cell is being read to a resolution of, say, 7-bits such as described in above referenced U.S. patent application Ser. No. 09/671,793, all of the reads are to this resolution, including all steps required to sense the full resolution, and passed on to the controller. These data states may be used to form a running average, such as shown in the averaging circuit 320 of FIG. 3, or else stored on the controller in the register 461 of FIG. 4 or other controller memory. By storing all of the measured data states, the controller can perform a wider variety of averaging processes. The data states can either be transferred from the memory circuit 400 as they are determined, or temporarily stored in the register 425 prior to transference.

[0043] From the measured values, a number of different composites can be formed in the controller. This composite could be a simple mean, weighted mean or mean with outlying values neglected, or a root mean square or mean based on other powers and be implemented through hardware, software, or firmware. A controller-based implementation can consequently be used without the introduction of extra circuitry, aside from possibly increasing the storage capacity. This sort of implementation does result in the largest amount of information transfer between the memory circuit and controller circuit of the described embodiments, which can be a drawback, particularly when the controller and memory circuit are on separate chips due to the increased bus traffic. A purely memory-device-based implementation would require the introduction of additional circuitry, possibly including sufficient register capacity to hold the results from the multiple reads for all of the cells being read in parallel.

[0044] One way to reduce the amount of information transferred to the controller is to do one full read on the cell, followed by a number of partial reads, with partial reads requiring fewer bits to indicate offset from baseline, where offset is limited in range. This would result is a base value Φ₀ for the data state and a number of reads Φ′_(i) showing the fluctuation about this base value due to noise. As this method produces less data to analyze (and possibly store), this saves both on the amount of data that needs to be transferred to the controller, if the averaging is performed there, and possibly on the time for the analysis to be performed, even in embodiments where the averaging is performed on the memory chip. It can also reduce the total read time.

[0045] Consider the case where the state of the memory cell is determined with a resolution of 7-bits. If the read uses a binary search technique, this would require at least seven reads at seven break points. Consequently, a full read of the state of the cell repeated four times would require a minimum of 28 sensing operations. If instead one full read is performed on the cell to determine Φ₀ followed by three reads with a 2-bit resolution to determine the fluctuations Φ′_(i) around the base Φ₀, this would result in 13 sensing-operations. In this example, the set (Φ₀,Φ′_(i)) will contain the same information as the full reads with less than half the sensing operations and less than half the data that need to be stored, transferred, or both. This of course assumes that the magnitude of the fluctuations is within the range that is measured about the base level. (Alternately, this will remove the effect of fluctuations of greater magnitude due to a saturation effect.) The range and number of the partial reads can be set by system parameters and determined during testing of the chip and may also be altered in response to system conditions. In a noisy circuit, the partial reads may need to be set to cover a larger range. For most arrangements, the best accuracy for a given number of sensing operations is obtained by having roughly the same number devoted to determining Φ₀ as to the Φ′_(i).

[0046] The binary search example may require a change of algorithm, as is shown in the schematic representation of FIG. 5 for one of the cells being read in parallel. Continuing with the previous example, to read the state of the cell with a 7-bit resolution could use 128 voltage break points. The first read in the series would determine if the cell is in the upper or lower half of the memory window, with the second read determining if the cell is in the upper or lower half of the previously determined half, and so on.

[0047] Consider such a binary search where, for example the cell is in the 64^(th) state, lying just above the level V₆₄ in FIG. 5. Without noise in either the cell or other circuit elements, it would read in the 64 ^(th) state. Consequently, it would read above the break point in the first read (501 a) and below the break point in each subsequent read (501 b-501 g). This is represented in FIG. 5 by the direction arrow that indicates whether the measure voltage is above or below the break point used to read the cell. With noise, it could read above or below this by several states. However, to determine the effects of this noise, just using the last few steps of the algorithm (say 501 f and 501 g) could never capture a value less than the first breakpoint. To properly reflect the effects of the noise would either require full 7 step reads or a change of algorithm. Consequently, a full binary search can be used to determine a base level Φ₀ from the full read algorithm followed by a second mode to determine the Φ′_(i). The multiple, faster re-reads could use a reduced binary search centered about Φ₀ or switch to a non-binary method.

[0048]FIG. 5 shows the use of a reduced binary search for the re-reads 511, 513, and 515. Again, the first seven reads (501 a-501 g) determine the base state Φ₀ is the 64^(th)state. As a fluctuation about this state could easily be below the first read level 501 a, any subsequent reads using the same algorithm would miss these. Consequently, to accurately determine the Φ′_(i) corresponding to this Φ₀, the algorithm changes to a reduced binary search centered on the state Φ₀. FIG. 5 shows three partial reads (511, 513, 515) each consisting of a 2-bit binary search (a and b) centered on the value of the full read.

[0049] In this example, the first of these results reads above both the first break point (511 a) and the second break point (511 b), corresponding to the 65^(th) state. Note that although the real result may be well above this state due to a momentary large fluctuation, a “saturation” effect occurs, in effect limiting the amount such a spike will contribute to the average. The second set of re-reads yields a result below the first breakpoint in 513 a, followed by a read above the second breakpoint in 513 b, corresponding to the 63^(rd) state. Similarly, the re-reads of 515 a and 515 b again return the 64^(th) state. Thus in this example, the first read for Φ₀, contains 7-bits of data for each cell being read, while each of the rereads contain 2-bits of data, or a total of 6-bits for the entire re-read process.

[0050] The process of FIG. 5 is one example of a read operation having a first read mode where the cell is only read once and a second mode where the cell is read multiple times. In another example, in which the measured parameter of the cell is compared to a number of reference levels in parallel, the first read could be a coarse mode where some number of the most significant bits are read followed by multiple reads for the remaining least significant bits. In a further example, during a verify process such as is shown in FIG. 2, during the early stages when the cell is far from the target value (such as in 201), the cell could be read only once, with a change of reading mode as the cell approaches the target value (such as in 204 and 205). The use of a programming technique having a coarse mode and a fine mode is described in U.S. patent application Ser. No. 09/793,370 filed Feb. 26, 2001, which is hereby incorporated by reference.

[0051] Returning to FIGS. 4a and 4 b, as already noted any of these embodiments can be implemented by transferring all of the data, either the multiple full reads Φ_(i)(p) or the set of multiple partial read and one full read (Φ₀, Φ′_(i)), to the controller 460 and processing it there, as depicted in FIG. 4a. Alternatively, all of the processing could be done on the memory chip with only the final result {overscore (Φ)} transferred to the controller and then to outside of the system, as depicted in FIG. 4b. The processing could also be split, with the reads transferred in an intermediate state. For example, the partial reads could be accumulated on the memory chip and then transferred to the controller for division or other further processing.

[0052] Another example in which the processing is split between the controller and the memory chip involves the use of reference or tracking cells, such as those described in above the referenced U.S. patent application Ser. No. 09/671,793. This application describes a method in which the tracking are cells are read with a 7-bit resolution in order to determine breakpoints for reading the data cells with a 4-bit resolution. In this method, the tracking cells could be read multiple times according to the present invention with the composite formed on the controller. Once the breakpoints are determined using the present invention, these breakpoints would then be transferred back to the memory chip to read the data cells.

[0053] One method for determining the composite on the memory chips would be to form a running average on the memory circuit 400. This could be an accumulator and division circuit corresponding to block 320 of FIG. 3. Alternately, the values could be accumulated by element 423, stored in register 425, and then transferred to controller 460 for further processing. The read values could also be first be individually stored in the register 425 and then processed by other circuitry on the memory chip as described above for the controller implemented examples.

[0054] Implementing the averaging on the memory circuit 400 will generally have the cost of introducing additional elements on the circuit that are usually not included. Depending on how the mean or other composite is formed, this could include some combination of increased register size for storing read values, the introduction of additional circuitry, division circuitry, some sort of state machine, and so on. On the other hand, the resultant reduction in the amount of information that needs to be transferred to the controller can be particularly advantageous in the program verify process for maintaining high write speed.

[0055] As described in U.S. Pat. No. 5,172,338 and other references incorporated by reference herein, a typical programming process in a floating gate type memory cell consists of pulsing the cell to change its state, reading the cell to determine if it has achieved the desired state, terminating further programming if so and if not continuing the pulse-read cycle until the cell is verified to have achieved the desired state. The effect of noise was discussed above with respect to FIG. 2. To store and accurately retrieve data in a memory requires accuracy in both the writing and reading process. Referring to FIG. 2, the mean value of 205 corresponds to the desired state; however, if the programming were stopped in response to peak values of 204 reaching above the target level, this would result in state with a lower mean being verified as correct. This error may then be compounded by noise during the read process. Consequently, reducing the consequences of noise in the verify process is equally important as the density of the states stored in each single cell increases.

[0056] The different aspects of the present invention can be incorporated into the verify process in a number of ways. The verify situation differs from the standard read process in that state of the cell is being measured against a data dependent target value of the measured parameter that the iterative programming algorithm must satisfy, as opposed to searching through a reference parameter space to determine the actual state of the cell. In other words, not as much information and processing is typically involved in program verify, only whether or not it has passed a given reference value. One embodiment of this technique would use a peak-detection scheme. Alternatively, other detection means, such as analog filtered average detection may be used. In this scenario, several verify checks are performed at the target state prior to deciding if the storage element has reached the final state. If some predetermined portion of the verifies fail, the storage element receives additional programming. For example, although the state corresponding to 204 in FIG. 2 has peaks that would cross the target value twice if multiple more or less instantaneous reads were performed on the cell, it would read over the target value no more than twice. In contrast, the state corresponding to 205 would read over the target value a majority of times.

[0057] To perform multiple reads when the state of the cell is still far from the desired state would greatly slow the process for little increase in accuracy. Consequently, in the early stages of the programming process when the cell is still far from the target value (for example, 201 in FIG. 2), the standard single read mode can be used, with a transition to the second mode as the target state is approached. This will happen after a different number of pulses for different states, with the transition occurring sooner for lower lying states. For example, the transition could be triggered by the first verify level having a peak to pass the target level (such as 204 in FIG. 2) or as part of the transition from a coarse programming mode to the fine programmed as described in U.S. patent application Ser. No. 09/793,370 incorporated above.

[0058] Another example of two-mode operation is that of the multiple-read mode which is invoked upon exception condition, with the standard, single read mode being used otherwise.

[0059] Another example of two-mode operation is that where a first mode uses a standard, single read and a second mode uses multiple reads. The multiple-read mode is invoked upon exception condition, with the single read mode being used otherwise. For example, the error correction code (ECC) is a prime indicator of a problem, or exception, in device operation. When the error level on the memory exceeds a predetermined value (for example, grater that one bit), the multiple read technique could be invoked.

[0060] An exception condition could also be based on reading of tracking or reference cells. As described above, in one embodiment the reference cells are read with a 7-bit resolution while the data cells are read only to a 4-bit resolution. Consequently, the reference cells will be more sensitive to noise. By using the reference cells to monitor the noise level, a flag to invoke the multiple read technique could be raised if the noise level is found to exceed a threshold when the tracking cells are processed.

[0061] Although the discussion so far has focussed on embodiments using a charge storing device, such as floating gate EEPROM or FLASH cells, for the memory device, it can be applied to other embodiments in which this type of noise is a problem, including magnetic and optical media. This invention can have far reaching applications in all types of device/transistor sensing, including, but not limited to, sub 0.1 um transistors, single electron transistors, organic/carbon based nano-transistors, and molecular transistors. For example, NROM and MNOS cells, such as those respectively described in U.S. Pat. No. 5,768,192 of Eitan and U.S. Pat. No. 4,630,086 of Sato et al., or magnetic RAM and FRAM cells, such as those respectively described in U.S. Pat. No. 5,991,193 of Gallagher et al. and U.S. Pat. No. 5,892,706 of Shimizu et al., all of which are hereby incorporated herein by this reference, could also be used. Using this approach, systems that are presently non-practical due to noise may become viable technologies. For these other types of storage elements, the particular mechanics of determining the parameter that reflects the state of the element may differ, for example magnetic properties would be measured in a magnetic medium and optical properties would be measured in a CD-ROM or other optical media, but the subsequent process follows readily from the examples above.

[0062] Although the various aspects of the present invention have been described with respect to specific exemplary embodiments, it will be understood that the invention is entitled to protection within the fill scope of the appended claims. 

It is claimed:
 1. A method of sensing the data content of a non-volatile memory, comprising: biasing a storage element of the non-volatile memory according to a set of operating parameters and target criteria; determining the value of a parameter indicative of the state of said storage element a plurality of times with the storage element biased according to said set of operating parameters and target criteria; and determining the data content of said storage element by forming a composite of said plurality of values of said parameter.
 2. The method of claim 1, wherein said storage element is a multi-state storage element.
 3. The method of claim 2, wherein said determining the value of a parameter is performed to yield a digital value.
 4. The method of claim 2, wherein the storage element is a charge storing device.
 5. The method of claim 4, wherein the parameter is a current.
 6. The method of claim 4, wherein the parameter is a voltage.
 7. The method of claim 4, wherein the parameter is a time.
 8. The method of claim 4, wherein the parameter is a frequency.
 9. The method of claim 4, wherein the parameter is a magnetic property.
 10. The method of claim 4, wherein the parameter is an optical property.
 11. The method of claim 2, wherein the forming a composite comprises: adding the plurality of parameter values to form a sum; and dividing the sum by the number of parameter values added.
 12. The method of claim 11, wherein the plurality of parameter values are individually stored prior to adding the plurality of parameter values.
 13. The method of claim 2, wherein the non-volatile memory comprises a memory unit containing a plurality of storage elements including said storage element and a controller, and wherein the composite is formed by the controller.
 14. The method of claim 13, wherein the values of the parameter indicative of the state of the storage element are individually stored on the controller prior to the forming a composite.
 15. The method of claim 13, wherein the values of the parameter indicative of the state of the storage element are individually stored on the memory unit prior to the forming a composite.
 16. The method of claim 2, wherein the non-volatile memory comprises a memory unit containing a plurality of storage elements including said storage element and a controller, and wherein the values of the parameter indicative of the state of said storage element are individually stored on the memory unit prior to the forming a composite and wherein the composite is formed by the memory unit.
 17. The method of claim 2, wherein said method of sensing the data content of a non-volatile memory is performed as part of the verify phase of a programming operation.
 18. The method of claim 2, wherein the forming a composite comprises forming a mean of the plurality of parameter values.
 19. The method of claim 2, wherein the forming a composite comprises forming a weighted mean of the plurality of parameter values.
 20. The method of claim 2, wherein the forming a composite comprises a peak detection scheme.
 21. The method of claim 2, wherein the forming a composite comprises omitting outlying values of the plurality of parameter values.
 22. The method of claim 2, wherein the determining the value of a parameter indicative of the state of the storage element comprises: determining a base value for the parameter; and determining variation from the base value a plurality of times.
 23. The method of claim 1, wherein said storage element is one of a plurality of storage elements read concurrently.
 24. A method of writing a data value in a non-volatile memory, comprising: altering the state of a storage element in the non-volatile memory; comparing a plurality of times a parameter indicative of the resultant state of the storage element to a reference value indicative of the data value; and determining whether to further alter the state of the storage element in response to said comparing a plurality of times.
 25. The method of claim 24, wherein said storage element is a multi-state storage element.
 26. The method of claim 25, wherein said comparing is performed using a digital numerical technique.
 27. The method of claim 25, wherein the storage element is a charge storing device.
 28. The method of claim 27, wherein the parameter is a current.
 29. The method of claim 27, wherein the parameter is a voltage.
 30. The method of claim 27, wherein the parameter is a time.
 31. The method of claim 27, wherein the parameter is a frequency.
 32. The method of claim 27, wherein the parameter is a magnetic property.
 33. The method of claim 27, wherein the parameter is an optical property.
 34. The method of claim 25, wherein said comparing comprises a peak detection method.
 35. The method of claim 25, wherein said comparing comprises an analog filtered average detection.
 36. The method of claim 25, wherein the non-volatile memory comprises a memory unit containing a plurality of storage elements including said storage element and a controller, and wherein the comparing is performed by the controller.
 37. The method of claim 25, wherein the non-volatile memory comprises a memory unit containing a plurality of storage elements including said storage element and a controller, and wherein the comparing is performed by the memory unit.
 38. The method of claim 24, wherein said storage element is one of a plurality of storage elements in which data are written concurrently.
 39. A method of sensing the data content of a non-volatile storage element, comprising: sensing the storage element in a first mode, wherein the storage element is sensed once for each of one or more first sets of bias conditions; subsequently sensing the storage element in a second mode, wherein the storage element is sensed a plurality of times for each of one or more second sets of bias conditions; and determining the data content of the storage element from a combination of the sensing in the first mode and the sensing in the second mode.
 40. The method claim 39, wherein said storage element is a multi-state storage element.
 41. The method claim 40, wherein the first mode is a coarse mode to determine an approximate data value and the second mode is a fine mode to refine the approximate data value.
 42. The method claim 40, wherein the first mode determines a first data value and the second mode senses variations about the first data value.
 43. The method claim 42, wherein the first mode is a binary search.
 44. The method claim 43, wherein the second mode is a binary search and the second sets of bias conditions are a subset of the first sets of bias conditions.
 45. The method claim 44, wherein the algorithm of the binary search of the second mode is determined based upon the first data value.
 46. The method claim 40, wherein method of sensing the data content of a non-volatile memory cell is performed as part of the verify phase of a programming operation.
 47. The method claim 46, wherein the first mode is applied during a coarse programming phase and the second mode is applied during a fine programming phase.
 48. The method claim 46, wherein the second mode is a peak detection method.
 49. A non-volatile memory comprising: an array of storage elements; read circuitry coupled to the array to provide a parameter associated with the state of a cell contained therein; a sense amplifier coupled to the read circuitry to determine the value of the parameter; and averaging circuitry coupled to the sense amplifier for forming a composite value for the single cell formed from multiple independently determined values of the parameter supplied from the sense amplifier.
 50. The memory of claim 49, wherein the parameter is a current level.
 51. The memory of claim 49, wherein the parameter is a voltage level.
 52. The memory of claim 49, wherein the parameter is a time.
 53. The memory of claim 49, wherein the parameter is a frequency.
 54. The memory of claim 49, wherein the parameter is a magnetic property.
 55. The memory of claim 49, wherein the parameter is an optical property.
 56. The memory of claim 49, wherein the averaging circuitry comprises an adder to produce a sum of the multiple independently determined values and a division circuit for dividing the sum by the number of independently determined values added to produce the sum.
 57. The memory of claim 49, wherein the memory comprises a storage section that includes the array, read circuitry, and sense amplifier and a controller section that includes the averaging circuitry.
 58. The memory of claim 57, wherein the controller section further includes a plurality of registers wherein the multiple independently determined values of the parameter supplied from the sense amplifier are stored prior to being supplied to the averaging circuitry.
 59. The memory of claim 57, wherein the storage section further includes a plurality of registers wherein the multiple independently determined values of the parameter supplied from the sense amplifier are stored prior to being supplied to the averaging circuitry.
 60. The memory of claim 49, wherein the memory comprises a controller section and a storage section, the storage section including the array, read circuitry, sense amplifier and the averaging circuitry.
 61. The memory of claim 60, wherein the storage section further includes a plurality of registers wherein the multiple independently determined values of the parameter supplied from the sense amplifier are stored prior to being supplied to the averaging circuitry.
 62. The memory of claim 49, further comprising: programming circuitry coupled to the array for changing the state of the storage elements contained therein and coupled to the averaging circuitry for receiving the composite value, wherein the composite value is used during the verify phase of a programming process.
 63. A non-volatile memory, comprising: means for simultaneously biasing one or more storage elements of the non-volatile memory according to a set of operating parameters and target criteria; means for determining the value of a parameter indicative of the state of each of said storage elements a plurality of times with the storage elements biased according to said set of operating parameters and target criteria; and means for determining the data content of said storage elements by forming for each of said storage elements a composite of said corresponding plurality of values of said parameter. 